Hidden Markov Model Based Animal Acoustic Censusing: Learning from Speech Processing Technology

نویسنده

  • Michael T. Johnson
چکیده

Individually distinct acoustic features have been observed in a wide range of vocally active animal species and have been used to study animals for decades. Only a few studies, however, have attempted to examine the use of acoustic identification of individuals to assess population, either for evaluating the population structure, population abundance and density, or for assessing animal seasonal distribution and trends. This dissertation presents an improved method to acoustically assess animal population. The integrated framework combines the advantages of supervised classification (repertoire recognition and individual animal identification), unsupervised classification (repertoire clustering and individual clustering) and the mark-recapture approach of abundance estimation, either for population structure assessment or population abundance estimate. The underlying algorithm is based on clustering of Hidden Markov Models (HMMs), commonly used in the signal processing and automatic speech recognition community for speaker identification, also referred to as voiceprinting. A comparative study of wild and captive beluga, Delphinapterus leucas, repertoires shows the reliability of the approach to assess the acoustic characteristics (similarity, dissimilarity) of the established social groups. The results demonstrate the feasibility of the method to assess, to track, and to monitor the beluga whale population for potential conservation use. For the censusing task, the method is able to estimate animal population using three possible scenarios. Scenario 1, assuming availability of training data from a specific

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Hidden Dynamic Models for Speech Processing Applications

Human speech has a dual nature: the goal of speech is to convey discrete linguistic symbols corresponding to the intended message while the actual speech signal is produced by the continuous and smooth movement of the articulators with rich temporal structures. Such a dual nature has been amazingly utilized by humans in a beneficial way but has presented a big challenge for both speech science ...

متن کامل

Sparse Hidden Markov Models for Automatic Speech Recognition

Stochastic speech recognition has been cast as a natural realization of the compressive sensing problem in this work. The compressed acoustic observations are subword posterior probabilities obtained from a deep neural network. Dictionary learning and sparse recovery are exploited for inference of the high-dimensional sparse word posterior probabilities. This formulation amounts to realization ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008